Fusing microarray experiments with multivariate regression

نویسندگان

  • Walter R. Gilks
  • Brian D. M. Tom
  • Alvis Brazma
چکیده

MOTIVATION It is widely acknowledged that microarray data are subject to high noise levels and results are often platform dependent. Therefore, microarray experiments should be replicated several times and in several laboratories before the results can be relied upon. To make the best use of such extensive datasets, methods for microarray data fusion are required. Ideally, the fused data should distil important aspects of the data while suppressing unwanted sources of variation and be amenable to further informal and formal methods of analysis. Also, the variability in the quality of experimentation should be taken into account. RESULTS We present such an approach to data fusion, based on multivariate regression. We apply our methodology to data from a previous study on cell-cycle control in Schizosaccharomyces pombe. AVAILABILITY The algorithm implemented in R is freely available from the authors on request.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Transcription Factor Binding Site Prediction with Multivariate Gene Expression Data by Nancy

Multi-sample microarray experiments have become a standard experimental method for studying biological systems. A frequent goal in such studies is to unravel the regulatory relationships between genes. During the last few years, regression models have been proposed for the de novo discovery of cis-acting regulatory sequences using gene expression data. However, when applied to multi-sample expe...

متن کامل

Transcription Factor Binding Site Prediction with Multivariate Gene Expression Data

Multi-sample microarray experiments have become a standard experimental method for studying biological systems. A frequent goal in such studies is to unravel the regulatory relationships between genes. During the last few years, regression models have been proposed for the de novo discovery of cis-acting regulatory sequences using gene expression data. However, when applied to multi-sample expe...

متن کامل

On the gene ranking of replicated microarray time course data

Consider the gene ranking problem of replicated microarray time course experiments where there are multiple biological conditions, and genes of interest are those whose temporal profiles are different across conditions. We derive the multi-sample multivariate empirical Bayes statistic for ranking genes in the order of differential expression, from both longitudinal and cross-sectional replicate...

متن کامل

Ensemble Logistic Regression for Feature Selection

This paper describes a novel feature selection algorithm embedded into logistic regression. It specifically addresses high dimensional data with few observations, which are commonly found in the biomedical domain such as microarray data. The overall objective is to optimize the predictive performance of a classifier while favoring also sparse and stable models. Feature relevance is first estima...

متن کامل

lmdme: Linear Model on Designed Multivariate Experiments in R

The lmdme package implements analysis of variance (ANOVA) decomposition through linear models on designed multivariate experiments in R (R Development Core Team, 2012), allowing ANOVA-principal component analysis (APCA) and ANOVA-simultaneous component analysis (ASCA). It also extends both methods with the application of partial least squares (PLS) through the specification of a desired output ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 21 Suppl 2  شماره 

صفحات  -

تاریخ انتشار 2005